New Limits on Fault-Tolerant Quantum Computation 
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Abstract 

We show that quantum circuits cannot be made fault-tolerant against a depolarizing noise 
level of 8 = (6 — 2\/2)/7 m 45%, thereby improving on a previous bound of 50% (due to 
Razborov |IE])- Our precise quantum circuit model enables perfect gates from the Clifford group 
(CNOT, Hadamard, S, X, Y, Z) and arbitrary additional one-qubit gates that are subject to 
depolarizing noise 8. We prove that this set of gates cannot be universal for arbitrary (even 
classical) computation, from which the upper bound on the noise threshold for fault-tolerant 
quantum computation follows. 

1 Introduction 

In the past decade, quantum computing has attracted much attention because of its ability 
to efficiently solve problems for which no efficient classical algorithms are known. Significant 
research efforts are dedicated to physically realizing quantum computers. A fundamental prob- 
lem is to cope with noise, which creates major difficulties in storing and operating on quantum 
states reliably. A key advance was the realization that quantum error correcting codes |23 1^| 
exist and fault-tolerant quantum computation |21j is possible for a number of reasonable error 
models. Subsequent results have improved on the first fault-tolerant schemes, proving better 
and better bounds on the noise tolerable in quantum computation (e.g. [HI El)- Recent results 
suggest that fault-tolerant quantum computation is possible with gates that have as much as 
3% of depolarizing errors |12) . but there is no rigorous proof so far. 

In this paper we will concentrate on the opposite task of proving that, for certain noise levels, 
quantum computation is impossible. Our main result is as follows: Let Clifford be the set of 
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The Gottesman-Knill Theorem says that this set of gates can be efficiently simulated classically 
(see also pQ), so they are probably not universal for quantum computation. On the other 
hand, it is known that Clifford together with any other one-qubit gate, not generated by 
the gates in Clifford, form a universal set of gates for quantum computation [231 114] . We 
show however, that such additional one-qubit gates should not be too noisy. More precisely, let 
Clifford* be Clifford augmented with arbitrary one-qubit gates with depolarizing error at 
least 9 = (6 — 2\/2)/7 ~ 45%. Then this set of gates is no longer capable of computing arbitrary 
functions and thus is not universal. In other words, fault-tolerant quantum computation cannot 
be performed if there is this level of noise. Additionally we show that, among all one-qubit 
gates that augment Clifford, the so-called 7r/8-gate (see end of Section EJ) is the type of 
gate that requires the most noise to render it incapable of universal quantum computation by 
our approach. That is, for other augmenting gates (e.g., 7r/16-gates), our approach will yield 
stronger bounds on the tolerable level. Our results also yield a simple proof that not all classical 
functions can be computed using Clifford gates (complementing results in [T]). In particular, in 
Corollary ^ we show that a boolean function which can be computed by Clifford circuits can 
be written as the parity of a subset of input bits. 

The main idea of our approach is as follows. Assume we have a Clifford circuit C with n 
classical input bits x = Xi, . . . ,x n and one dedicated output qubit that, when measured in the 
computational basis, yields the output of the computation of C on x. Suppose now that the 
input is partitioned over two parties, Alice and Bob, such that Alice has k bits of x and Bob has 
n — k bits. We first show how Alice, with the help of Bob, can compute the value of C on x with 
just a single classical bit of communication (Lemma^]). From this it follows that Clifford circuits 
can at the very best compute only those functions that require for any partition of the inputs 
a single bit of communication, and it is well known that many functions require more than one 
bit of communication. Next, we show in Lemma [21 how probabilistic mixtures of Clifford gates 
can be used to simulate any single qubit unitary gate, that has noise 0{~ 45%). The proof of 
our Lemma relies on solving an optimization problem related to the Clifford polytope, defined 
as the convex hull of the set C C R 3x3 of Clifford rotation matrices in R 3 . Here, the matrices C 
are essentially the one-qubit Clifford gates in Bloch sphere representation. 

Combining LemmasnandEl we get that all circuits with CLiFFORD*-gates and with respect 
to any distribution of the inputs can be computed by Alice and Bob with a single bit of com- 
munication (Lemma El). Using the fact that there are functions which require communication 
more than one bit, we get our main result (Theorem^: The set of gates in Clifford* cannot 
be universal. We also generalize our result to the case that the inputs are quantum states. 

The idea that a noisy 1-qubit gate can be simulated by a probabilistic mixture of Clifford 
appeared first in Virmani et al. (25]. The approach we take here though is an extension to 
quantum fault tolerant computation of the work by Brassard et al @|, where they exhibit an 
upper bound on the noise threshold for classical fault tolerant computation, using lower bounds 
on quantum communication complexity and the non-local CHSH correlation. 

We want to point out that section can be read independently of the preceding section. 
It shows that gates from Clifford*, together with all stabilizer operations and classical co- 
processing are classically simulatable and thus probably not quantum-universal. 



1.1 Related Work 

There are only a few other results concerning the limits of fault-tolerant computation. These are 
not all strictly comparable to each other and our result; nevertheless, we review them and make 
some comparisons. See the introduction of |18| for some remarks that motivate the analysis of 
thresholds for fault-tolerant quantum computation. 

The first results on upper bounds of the threshold decoherence rate were obtained by showing 
that quantum computers with faulty gates can be simulated efficiently on a classical computer. 
The first to prove one of these results were Aharonov and Ben-Or [2] , with the value 97% for 
the noise. Later Harrow and Nielsen ^J showed that if 74% of depolarizing noise is applied 
to each output qubit of each gate, then (faulty) two-qubit gates cannot produce entanglement. 
They concluded that circuits containing only one- and two-qubit gates with depolarizing noise 
at least 74% can be simulated efficiently on a classical computer. 

An improvement of this is due to Virmani et al. 25 who show that the set consisting of 
CNOT with depolarizing noise at least 67% and arbitrary 1-qubit gates is efficiently simulable 
classically. In this paper they also introduce the interesting idea that sufficiently noisy 1-qubit 
gates can be simulated by Clifford gates; we build on and extend this idea in this paper. We note 
however, that their strongest results are for a restricted class of gates (ones which are diagonal in 
the computational basis) and dephasing or worst-case noise. They prove that (y/2— l)/v2 ~ 29% 
dephasing noise is enough to make these diagonal gates a mixture of Clifford operations 1 . We 
extend their results by considering all 1-qubit gates. Note also that dephasing noise is only 
symmetric around the z-axis, which is natural when considering diagonal gates. Our noise 
bounds are with respect to depolarizing noise, which is symmetric in all directions, and hence 
appropriate when considering arbitrary one-qubit unitaries. 

Note that all these results do not exclude the possibility that quantum circuits with high 
noise can still do universal classical computations; our results imply this. 

The only prior result of this latter type is due to Razborov ^S]> where a 50% upper bound on 
the noise threshold for depolarizing noise on qubits for circuits with two-qubit gates is obtained 
(and a weaker bound for fc-qubit gates). The argument in ^2] is essentially that, at this noise 
level, any superlogarithmic-depth quantum circuit (with constant error rate per qubit per time 
step) will be overwhelmed by the noise and produce a statistically meaningless outcome. Thus, 
under the complexity theoretic assumption BQP ^ QNC 1 , there are sets in BQP which can be 
computed with this noise level. We note that it is shown in [7] that in fact log-depth quantum 
circuits can perform interesting feats, including efficient integer factorization (if combined by 
classical polynomial-time pre- and post-processing) . Our error model is in most respects weaker 
than that of ^Hj (since our qubit errors are only occurring at the completion of non-CLiFFORD 
gates) and our bound of « 45% is below 50%. In fairness, there is a sense in which the bound in 
[T%1 is stronger: it permits arbitrary (noisy) two-qubit gates; whereas, our only two-qubit gates 
are (perfect) CNOT gates. 

Finally, we note that our work is related to, and partly stimulated by, the circle of ideas 
surrounding measurement-based quantum computation that was largely initiated by |l()l ITTj . 

2 Preliminaries and Notation 

Eij is the all-zero matrix, except for the entry i,j which is equal to 1. We also write + for +1 
and — for —1. For matrices A,BG R 3x3 we define the inner product (A, B) as: 

(A, B) = tr(A T B) = J^ a ^K- 
i,ie{i,2,3} 



They define dephasing noise aspn l/2(p + ZpZ). 



The following fact is used repeatedly: (A, BC) = (B T A, C) for A, B, C £ K 3x3 . 

By A* we denote the conjugate transpose of matrix A. 

An n-qubit state (or density matrix) p is a matrix p £ C 2 x2 with the properties tr(p) = 1, 
p = p* (Hermiticity) and p is positive semi-definite. An n-qubit operation (or gate) is a unitary 
matrix £/ 6 C 2 x2 , i.e., ?7*?7 = I. For such n-qubit state p and n-qubit operation U the 
application of U to p results in the state UpU* . 

2.1 Bloch-vector representation 

In our further analysis it will be convenient to use the Bloch-sphere representation of 1-qubit 
states and 1-qubit operations, which we review now (see e.g. Section 4.2 and Chapter 8 in jlftj). 
For r £ R 3 define r ■ a — r x X + r y Y + r z Z, where a — (X, Y, Z) is the vector of Pauli 
matrices 

*-(! ') r-(° 7) z=(l °V P> 



Then, all 1-qubit density matrices p can be uniquely written in the form 

_I + r • a _ I + r x X + r y Y + r z Z 



where reR 3 and ||r|| = wr 2 + r 2 + r 2 < 1. We call r the Block vector of p. 
For net 3 with ||n|| = 1 and 9 £ R we define 

U a (6) = exp(-i(9n ■ a/2) = cos(0/2)I - i sin(0/2)n • <r. 

We first note that U n (9)U n (8)* = I, i.e., U n (8) is unitary. Second, let the result of the quantum 
operation U n {9) applied to state p = l/2 + r-a/2 be p' = U n {9)* pU n {9) = I/2 + r' -a/2. Then r' 
is the image of rotating r around n by an angle 9. Third, all 1-qubit unitaries U can be written 
as 

U = U n {9) 

with n £ M. 3 ,9 £ R and ||n|| = 1 (ignoring an unimportant phase factor a £ C with |a| = 1). 

Thus, one-qubit states and unitaries are isomorphic to vectors ,resp., rotations in R 3 . The 
set of all rotations in R 3 is the group SO(3). 2 We introduce some notation reflecting this 
isomorphism. For unitary U £ C 2x2 we let Rjj £ SO(3) be the corresponding rotation matrix. 
We get a reverse operation (up to phase factors) by fixing one mapping / : 5*0(3) — > C 2x2 with 
the property that for all unitary U £ C 2x2 it holds f(Rjj) = ctU for some a £ C, \a\ = 1. We 
then write Ur = f(R). 

This can be extended to probabilistic mixtures of quantum operations. Let {pi} be a prob- 
ability distribution, i.e., Y^iPi = 1 an d — Pi> an( ^ l et Ui e C 2x2 be a 1-qubit unitary with 
corresponding Bloch representation Ri £ R 3x3 . Then the quantum operation E in which each 
Ui is applied with probability pi has Bloch-representation Re = ^2iPiRi- 

2.2 Noise 

There are several models of noise considered in the literature. The most common one, which we 
consider too, is depolarizing noise. A 1-qubit state p to which depolarizing noise p is applied, 
becomes 

(l-p)p + P I/2. 

Thus, with probability 1—p the state is not changed and with probability p the state is replaced 
with the completely mixed state. 



2 This group will play a prominent role in the proof of Lemma|21 where some more notation can be found. 



It is not hard to see that applying depolarizing noise p to p = 1/2 + r • cr/2 yields p' = 
1/2 + r' • cr/2, with r' = (1 — p)r. So, this noise shrinks the Bloch vector of a state to (1 — p) of 
its original length. 

We say that a 1-qubit gate implements the unitary U with noise p if it transforms states p 
into 

(l-p)U P U* +pl/2. (3) 

This quantum operation can be seen as a two-stage process, in which first U and then depolar- 
izing noise is applied. Let Ru £ R 3x3 be the rotation matrix corresponding to the unitary U. 
Then this noisy quantum operation has Bloch-representation (1 — p)Ru, i.e., it rotates a Bloch 
vector and scales it by a factor 1 — p. 

For 1-qubit gates and depolarizing noise, the two representations are (up to unimportant 
global phase factors) equivalent. (See Section 8.3 in ^] for more details.) 

2.3 Clifford group 

The (n-qubit) Clifford group contains all unitary operations that can be written as a product 
of tensor products of S, H and CNOT (see Eq. Q). The Clifford group contains also all Pauli 
operators X, Y, Z. We let Clifford be the set of all Clifford gates. Let Clifford* be the set 
of gates consisting of Clifford and arbitrary 1-qubit gates which have depolarizing noise at 
least 6 = (6 - 2\/2)/7. 

For a state with Bloch vector r we get: 

s (}i l+r i x + r -i Y+r i z ) s - = \ l -¥ + r i Y + r i z 

Let Rs be the Bloch representation of S. Then R$ rotates Bloch vectors around the z-axis by 

7r/2. In particular, the x-axis is mapped to — y and y to x. For the Hadamard-gates we similarly 

have 

/ 1 r T r„ r 2 \ „ 1 r, r„ r T 

H -I + —X + -^-Y + —Z )H* = -I + —X - -0-Y + —Z. 

V2 2 2 2 J 22 2 2 

So the Bloch representation Rh of H negates the y-coordinate of a Bloch vector and swaps the 
x and z-coordinates, i.e., it is a rotation by 7r around the axis l/v2(l,0, 1). 

We define C as the set of matrices which can be generated from Rs and Rh- A C £ C is 
called a Clifford (rotation) matrix. It is not hard to see that C contains exactly those rotations 
which map axes to axes (or their opposite). Those C have in each row and column exactly one 
non-zero entry, which must be either +1 or —1, and det(C) = 1. Note that C, being isomorphic 
to the 1-qubit Clifford group, is a group under matrix multiplication. Examples of Clifford 
matrices are 

' 1 \ / 1 0\/10 

10,0-1 1,0 1 

1/ \ -1 / \ -1 

In Appendix 1X1 we need (and explain) more details about Clifford rotation matrices. 

2.4 Communication Complexity 

The setting for this is the following: Assume two separated parties, Alice and Bob, where Alice 
is given x € {0, l} mA and Bob y £ {0,l} mb , want to compute f(x,y) for some fixed function 
/ : {0,1}™ A x {0,l} m£J — ► {0,1}. We want that at least one party learns the result f{x,y). 
In order to achieve this they can communicate bits, according to a predefined protocol. The 
deterministic communication complexity Ct(n) of / is the smallest number c such that each 
protocol which always computes the correct result, needs at least c bits of communication for 
at least one input x,y £ {0, 1}". 



It is well-known that there are functions / where Cf(n) is n, for example the inner product 
function (see |13|h The above can be extended to randomized communication, where the parties 
are additionally provided with a source which sends a sequence of random bits to Alice and the 
same sequence to Bob. The final result only has to be correct with some probability 1 — e for 
e < 1/2. The minimum number of bits needed to be communicated such that the output is 
correct with probability at least 1 — e is denoted by CS(n). However, also in this randomized 
setting there are "hard" functions. For example, it is known that the inner product function 
has randomized communication complexity n — 0(log(l/i5)), if the outputs have to be correct 
with probability at least 1/2 — S (see also |13|). 

We will also slightly abuse notation and define communication complexity for functions 
/ : {0, 1}" — > {0, 1} which depend only on one input string. For any S C {1, . . . , n} let C/(n, S) 
be the communication complexity of / if the bits with indices in S are given to Alice and all 
others to Bob. We then set Cf(n) = niaxjc^ ^ n j Cf(n, S). 

3 The power of Clifford circuits 

We are now ready to prove the main lemma, which explains the idea of simulating Clifford 
circuits. 

Lemma 1. Let f : {0, 1}™ — * {0, 1} be a function that is computable with unbounded error 3 by 
a quantum circuit C that uses only gates from Clifford , ancillas initialized to |0) and one 
single-qubit measurement in the computational basis, which determines the output. Then the 
deterministic communication complexity Cf(n) is at most one bit. 

Proof. We begin by noting that each qubit can be represented by two shares: a classical share 
consisting of two bits, and a quantum share consisting of one qubit . When the classical share 
is ab and the quantum share \ip), then the logical qubit that the shares encode is X a Z b \i/j). 

Assuming that a set of qubits is encoded in this manner, the operations H, S, and CNOT 
can be applied to the logical qubits by separately performing operations on the shares that 
encode them (i.e., the logical qubits do not have to be reconstructed). The reason why this 
works is because for any Clifford operation C — H,S. CNOTj and any tensor product of Pauli 
operators P\ there is a tensor product of Pauli operators Pi with CP\ = P^C. For example, to 
apply H to a logical qubit, the two bits that make up its classical share are swapped and H is 
applied to its quantum share. This works correctly because 

HX a Z b \iP) = HX a HHZ b HH\iP) (4) 

= Z a X b H\ip) 
= (-l) aAb X b Z a H\tp), 

and (— l) aAfc is an irrelevant global phase. 

To apply S to a logical qubit, the 6-part of the classical share is updated to b := a © b and 
S is applied to its quantum share. This case can be verified by noting that 

SX a Z b \ip) = i a X a SZ a Z b \iP) (5) 

= i a X a Z a ® b S\ip), 

where we note that i a is a global phase. 

To simulate the application of CNOT2 gate 4 on two logical qubits, with classical shares a\b\ 
and a2&2, we update a 2 := a\ © a%, b\ :— b\ © b 2 and CNOTj is applied to the two quantum 



3 That means, that the output is only correct with probability greater than 1/2, but can go arbitrarily close to 1/2 



as n grows. 
4 control 



shares. In this case, we omit the details but note that the correctness can be verified using the 
identities 

CNOT*(A ® I) = (X <Z> X)CNOTl, (6) 

CNOT^(I®X) = {I®X)CNOTl 

CNOTl(Z®I) = (Z®/)CNOT^ 

CNOTl(I®Z) = {Z®Z)CNOTl 

We first describe a probabilistic communication protocol for /. Alice operates on the classical 
shares while Bob operates on the quantum shares. The initial shares are easy to construct: for 
each of Alice's input qubits \xj), Alice sets her classical share to aj :— Xj,bj := and Bob 
sets his quantum share to |0); for each of Bob's input bits yj, Alice sets her classical share 
to a,j = bj := and Bob sets his quantum share to \yj). In this manner, Alice and Bob can 
simulate the execution of circuit C on input |x)|y)|0 . . . 0) without any communication to obtain 
the shares of the output qubits of C. For Bob to obtain the measured output qubit, Alice 
sends the first bit of her classical share, aj., to Bob, who applies X ai to his quantum share and 
measures it (Alice need not send &i, the second bit of the classical share, since Bob is performing 
a measurement in the computational basis). 

Finally, to obtain a deterministic communication protocol for /, we note that Bob need not 
actually manipulate quantum information; rather, he can simulate his quantum registers and 
his operations with high enough precision on a classical computer. Then, upon receipt of the 
classical bit from Alice, he can exactly determine the output probabilities of his measurement 
to determine which outcome is more likely. □ 

From Lemma ^ we get that the set of functions computable with Cliffords gates is very 
limited and far from being universal. 

Corollary 1. Every function f : {0, 1}™ — ► {0, 1} which can be computed by a Clifford circuit, 
can be written in the form 

f(Xi .. .X n ) =C®Qxj, 

jes 
where S C [n] is a subset of the input bits not depending on the input bits and c € {0, 1}. 

Proof. Let / : {0, 1}" — ► {0, 1} be a function which can be computed by a Clifford circuit C. 
Then we can simulate this circuit as in Lemma Q] where we give Alice the whole input, i.e., 
nriA = n and tub = 0. 

Inspecting the proof of LemmaQ]wc sec that in each step Alice always updates her afs and 
frj's by computing the parity of two bits. So, the final bit she sends over, say a,, is just the parity 
of some of the input bits. Thus we can write a, = © 7 - e s Xj, for some S C [n]. Bob initializes 
all his quantum bits to |0), so he starts with the state \tp°) = |0 . . . 0). Further, Bob just applies 
the circuit C to his state and measures the i-th qubit of X ai C\ip°) in the computational basis. 

It is known that the probability for measuring 1 in a Clifford circuits is either 0, 1/2 or 1 
(see ^3] page 463). It cannot be 1/2 in our case, because that would mean that the circuit does 
not compute /. So, measuring the i-th bit of C\ip°) yields a bit c € {0, 1} with certainty. But 
this means that f(x) = c ® ai = c ® (B 7e s x i- ^ 

We mention that Aaronson and Gottesman proved pQ that there is a log-space machine which 
transforms a Clifford circuit C into a classical circuit C consisting only of CNOT and NOT 
gates, with the property that C accepts the all zero state |0)® n iff C accepts the (classical) 
all zero input. Our corollary extends this slightly: For every Clifford circuit C computing a 
boolean function, there is an equivalent (for classical inputs) classical circuit which uses only 
NOT- and CNOT-gates. Using the result from pp we see that we can compute the bit c in the 

7 



proof of Corollary ^ in log-space and it is also clear that the circuit Alice uses to compute a, 
can be computed in log-space. 

Quantum inputs LemmaQJcan actually be extended to she case where Alice and Bob get 
quantum states as inputs and they are provided with entanglement. It is no problem for Bob to 
start with a quantum state as an input. For Alice we do the following. We let her teleport her 
quantum input to Bob bit by bit, using the standard scheme for teleportation (see e.g. |15j). 
When Alice teleports a qubit, which corresponds to the i-th input qubit of the circuit C to be 
simulated, she measures two classical bits. Now, if she does not send these to Bob, but rather 
initializes her cii,bi with these bits, Alice and Bob obtain the correct representation for qubits 
of C as in Lemmas Since the inner product function has communication complexity fl(n) even 
in the presence of entanglement [H] we see that Theorem Q is also true for quantum inputs. 

Note however, that now Bob can no longer compute the result of the circuit with certainty, 
because he does not know his input state. The correctness probability will only be the same as 
that of the simulated circuit C itself. 

Remark 1. It is straightforward to extend these results to functions with m output bits, if the 
communication complexity of the function is also higher than to, resulting in a scheme that uses 
to bits of communication. 

4 Simulating unitaries 

We want to extend Lemma El by replacing Clifford with Clifford*. To do that we first 
show how one can simulate arbitrary 1-qubit gates with depolarizing noise = (6 — 2\/2)/7 with 
a probabilistic mixture of Clifford operations. 

Lemma 2. Let U be a 1-qubit unitary and Ejj be the following noisy version of it 

p h+ Eu{p) = (1 - §)UpU* + 01/2, 

for any p G C 2x2 . Then there is a probability distribution {pc} overC such that for all p G C 2x2 
we have 

E u {p)^Y. r PcVcpV* c 
cec 

and Uc is a Clifford operation corresponding to the Clifford rotation matrix C . 

Proof. Using Section 12. II the lemma can be reformulated equivalently in Bloch representation: 
For any 5 G 50(3) there is a probability distribution {pc} over C such that 

(i-§)s = J2pcC (7) 

cec 
We will prove this latter statement. Define the Clifford poly tope 



P := conv(C) = J 5 | 5 = £ p c C,p c > 0, ^> C = 1 1 
I cec cec J 



(8) 



as the convex hull of the 24 Clifford rotation matrices in R 3x3 . We have to prove 

(1-6)S€P for any S G 50(3). (9) 

For this we use the fact that the Clifford polytope can be alternatively described by its facet 
description: 

P= {5GM 3x3 I (F,S) < 1 for all Fg T} , (10) 



where 

T := {C\BC 2 \C 1 ,C 2 EC, Be {B±,B{, B 2 }} , (11) 

/ 1 \ / 1 -1 

B x := 1 , B 2 := 1 1 
\ 1 J \ -1 

One can use the software from [H] for computing the facet description (|1C)|) : we will give a direct 
proof in the Appendix. In view of 1)10(1 , our claim is equivalent to 

(1 - §){F, 5) < 1 for all 5 e 50(3), Fef. (12) 

Let F e T of the form F = C X BC 2 where C u C 2 € C. As (F, 5) = (CffSCg, B) and Cj SC^ € 
50(3), {T2J is equivalent to 

(5, B) < — ^ = 2^2-1 for all B G {Bi, B 2 }, 5 e 50(3). (13) 

1 — 

The case B = B x is easy to handle: For 5 e 50(3), (5, B x ) = ^Li ^i < v 7 ^ < 2\/2 - 1. We 
now show (H3J for B = B 2 . Write 5 € M 3x3 as 

(ai a 2 a 3 \ 
6i 62 63 I - (14) 

Cl c 2 c 3 / 

Well-known necessary and sufficient conditions for 5 € 50(3) are 

a T b = 0, c = a x b, a T a = 1, b T b = 1, (15) 

where x denotes the vector product, defined as 

a x b := (a 2 b 3 - a 3 b 2 ,a 3 bi - aib 3 , aib 2 - a 2 bi) T . 

Recall that, for a, b, c as in i|15|) . a — b x c and b = c x a. Using c 3 = a\b 2 — a 2 b\, we obtain 
(B, 5) = ai — a 2 + b\ + b 2 — a\b 2 + a 2 b\. Therefore our task is now to prove that the optimum 
value of the program 

max / := a,\ - a 2 + b\ + b 2 — a\b 2 + a 2 bi 

s.t. .,, -.., -. ■•■= - ■ (16) 



3i 


= a\ + a\ + a\ = 1 


9i 


= b\+bi+bi=\ 


93 


= a\b\ + a 2 b 2 + a 3 



is at most 2y2— 1; we in fact show that max/ = 2y2— 1. For this, consider a global maximizer 
(a, b) to the program ((16JI . Then, the Karush-Kuhn- Tucker conditions have to be satisfied, since 
the gradient vectors {Vgi(a, b) \ i = 1, 2, 3} are linearly independent; see, e.g., Theorem 12.1 in 
|lfi| . (Here, the gradient vector Vgt(a,b) consists of the partial derivatives with respect to the 
six variables oi, . . . , 63.) That is, there exist scalars Ai, A2, A3 for which 

V/(o,6)+ J2 AiV ffi (a,6) = 0. 

i=l,2,3 

Equivalently, considering the partial derivatives first with respect to (ai,a 2 ,a 3 ) and then with 
respect to (61,62,63) 

I I-62 \ 

+ 2Aia + A 3 b = 



2A 2 b + A 3 a = 0. 




Multiplying the first and the second line by c T = (a x b) T (recall that c 1 a, b) we get 

= Ci(l — 62) + C2(— 1 + bi) = a - c 2 + a 3 
= ci(l + a 2 ) + c 2 (l - ai) = ci + c 2 + 63. 

Adding (resp. subtracting) these equations yields 2ci = —0,3 — 63 and 2c 2 = 03—63. Squaring 
these two equations and then adding them gives 2a\ + 2b\ = Ac\ + 4c 2 . Since the rows and 
columns in S are normalized, we get 2(1 — C3) = 4(1 — 03), f rom which we conclude c\ = 1 and, 
therefore, 03 = 63 = ci = c 2 = 0. This implies a\ + b\ = 1 = of + a\ and thus |6i| = |a 2 |. 
Similarly one can establish |ai| = |6 2 |. On the basis of this observation we distinguish three 
cases. 

1. ai = b 2 = 0. Then, |a 2 | = |6i| = 1 and / = — a 2 + 61 + a 2 b\ < 1. 

2. ai 7^ and ai = — 6 2 . From a T b — we have ai(&i — a 2 ) = 0, which gives a 2 = b\. Then, 
/ = ai - a 2 + a 2 - a x + a{ + a 2 = 1. 

3. ai 7^ and a\ = 6 2 . From a T 6 = we have a\(b\ + a 2 ) = 0, which gives a 2 = —61. Then, 
/ = a± — a 2 — a 2 + ai — a\ — a 2 = 2(ai — a 2 ) — 1, which (under the condition a\ + a 2 = 1) 
is clearly maximized by a\ — — a 2 = l/\/2- Therefore, we find max/ = 2y2 — 1. 

Thus, we have shown that the optimum value of the program l|16|l is equal to 2\/2 — 1, which 
concludes the proof. □ 

Lemma 3. Let f : {0, l} mA x {0, l} mB — » {0, 1} be a function and K a quantum circuit for f 
with correctness probability c > 1/2 which uses only gates from CLIFFORD* and measurements 
in the computational basis. Then the randomized communication complexity of f is at most one 
bit. 

Proof. From Lemma ^ we know how two parties, Alice and Bob, can simulate perfect Clifford 
gates. From Lemma |21 we know how they can jointly simulate the other noisy 1-qubit gates 
in Clifford*, where they use shared randomness to make sure that they always simulate the 
same Clifford gate. □ 

We can now prove an upper bound on the noise in fault-tolerant quantum computation. 

Theorem 1. The set of gates from Clifford together with 1-qubit gates with depolarizing 
noise more than 8 and one single- qubit measurement is not sufficient for arbitrary classical 
computation. 

Proof. The result follows by Lemma[!|]and the fact that there are functions with communication 
complexity greater than 1. In fact we have that none of the functions with unbounded error 
communication complexity C t i nhounded > 1 can be computed. □ 

5 Discussion and extensions 

Best gates From the proof of Lemma we see that the rotation matrix S which achieves 
the optimal value, is 

1/V2 -1/V2 \ 

1/V2 1/V2 . (17) 

-1 / 

Multiplying from the right by the Clifford-matrix diag(l, —1, —1) we get a rotation around the 
z-axis by 7r/4. The 7r/8-gate 

exp(-i7r/8) 

exp(i7r/8) 
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performs a rotation of 7r/4 around the z-axis. So, the 7r/8-gate and its symmetric versions are 
the ones which need the most depolarizing noise to be simulated by gates from Clifford. 

Worst case noise In Lemma we asked with how much depolarizing noise all 1-qubit 
unitary gates are equivalent to probabilistic mixtures of Clifford gates. Similarly to [25) one can 
also ask how much arbitrary noise is needed to make every gate a mixture of Cliffords. More 
precisely what is the value 9 = swpuesu(2) Pu > where pjj is the infimum of all p such that there 
is a completely positive trace-preserving 1-qubit quantum operation £jj with the property that 
the noisy implementation of U 

U' :p^(l-p)U P U* +p£u(p) 

becomes a probabilistic mixture of Clifford operation. 

In this section we will provide some bounds on 9. Let K £ SU(2) be any operation that in 
Bloch representation maps the state X-eigenstate vx = (1,0,0) to u = -t?(1, 1, 1)- Note that 
a probabilistic mixture of 1-qubit Clifford operations C = ^2iPiCi can map vx only into the 
octahedron O spanned by vx = (1,0,0), vy = (0,1,0) and vz = (0,0,1) and their negatives 
— vx, — vy, —vz (see also p])- Note that the state of O which is closest to u is |(1, 1, 1) = -j=u 

and their distance is \\u — l/ySu\\2 = 1 tk- The Bloch-state which is furthest away from u 

is —u. All three of these states lie on a line. With this it is clear that the state u no i se which 
needs the smallest noise p, such that (1 — p)u + pu no i se is inside the octahedron is — u and the 
optimal p is |(1 - ^=). This implies 21% « |(1 - ^=) < 0. 

To get an upper bound, recall that by Lemma|21for any gate U £ SU(2) the operation 

U' :/)h {l-p)UpU* +pl/2 

is a Clifford operation, if p > 6. Setting £ v {p) = \ (XUpU*X + YUpU*Y + ZUpU*Z) and 
noting that for any 1-qubit density matrix it holds 1/2 = \(p + XpX + YpY + ZpZ) we can 
rewrite the action of U' also as 

U' : p ~ (1 - \p)VpU* + \p£u{p). 

Thus < j9 m 34%. Note that this is certainly not tight, since all gates, apart from the ir/8- 
gate (and its symmetric versions), need less than 9 depolarizing noise to make it a probabilistic 
mix of Clifford operations, which implies they need less than j9 worst case noise. However, as 
follows from [5S], the worst case noise for the 7r/8-gate(s) is only i K= w 15%. 

We leave it as an interesting open question to determine the precise value of 9. 

Classical Co-processing Theorem ^ states that fault tolerant quantum computing is not 
possible if we have depolarizing noise at least 9 w 45% on one qubit gates even if we can use 
perfect gates from Clifford in our fault tolerant circuit design. Is this optimal? Could it be 
that with less than 9 noise on the single qubit gates and perfect gates from Clifford still no 
fault tolerant circuit design is possible. We leave this as an open question, but Ben Reichardt ^Hj 
pointed out that when we allow perfect classical computation in addition to perfect gates from 
Clifford and perfect measurements in the computational basis, for any quantum circuit one 
can build a fault tolerant quantum circuit, that tolerates noise less than 9 on single qubit gates. 
This fault tolcrcnt implementation has only a constant factor slowdown in time. 

The argument builds upon magic-state distillation, introduced in [S], and goes as follows. 
Assume we have at our disposal noisy 7r/8-gates X", with depolarizing noise strictly less than 9, 
i.e. T'{p) = (1 -p)TpT* +pI/2 with p < 9, where T is the perfect tt/8 gate. Then apply T to 
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the second half of an EPR-pair and measure the observable Z®Z, which can be implemented as 
a measurement in the computation basis with additional gates from Clifford. If the outcome 
is —1 throw away the state and do the experiment again. If the outcome is +1, apply a CNOT 
from the first to the second qubit, which gives 

1/ 1 — pi 1 — v 1 „\ , . , , , . 

-.X + - £-_y ®|o}(0|. (18) 



2 V l-p/2y/2 l-p/2y/2 

Using the result from [20] an arbitrary supply of qubits in the state of the first qubit of 1|18|) 
can be used to distill magic states in the iJ-direction, which together with stabilizer operations 
is sufficient for quantum computation. We do not know if this also holds for other than the 
7r/8-gate. 

Note the this is tight for the 7r/8-gate, since stabilizer operations (Cliffords, measurements in 
the computational basis and classical co-processing)together with 7r/8-gates with depolarizing 
noise 9 can be efficiently simulated classically, as follows from our Lemma|21and the Gottesman- 
Knill Theorem. 

Allowing some perfect unitaries Our threshold theorem says the following. Let / be 
a function such that it requires more than one bit of communication in order to compute it, 
when the input bits are partitioned over Alice and Bob. There is no quantum circuit consisting 
of perfect Clifford operations and single qubit gates with noise 9 (« 45%) that can compute /. 
We can strengthen this result to allow a small amount of perfect single qubits as well. Assume 
that / requires m bits of communication to be computed. There is no quantum circuit that 
uses perfect Clifford operations, s perfect single qubit gates, and single qubit gates with noise 
9 that computes /, for 2s + 1 < m. The reason we get this strengthening is because in our 
simulation, Lemmas 1 and 2, Alice sends to Bob whenever he wants to perform a perfect single 
qubit gate on some qubit, her classical share a and b of that specific qubit. Bob can now perform 
the perfect qubit gate on that qubit and they proceed as in Lemma 1 and 2. By the end of the 
simulation Alice has sent 2s + 1 bits to Bob and he will be able to compute /, contradicting 
that the communication complexity of / is at least m > 2s + 1. 
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A Computing the facets of the Clifford polytope 

We give here the facet description for the Clifford polytope P defined in (jHJ as the convex hull 
of the set C consisting of the 24 Clifford rotations of the 3-space. Define the polyhedron 



Q:={S£ 



(3x3 



I (F,S) < 1 for aUFgf}, 



where T is as in (|ll(l . Our objective is to show the equality P — Q. 

To start with, let us prove the easy inclusion PCQ. For this, let C £ C and F £ J- of the 
form F = dBC 2 with C U C 2 £ C and B £ {B u Bl,B 2 }. Then, (F,C) = (B,Cf[CC%). As 
CjCCl £ C, it suffices to verify that (B, C) < 1 for any C £ C and B = B 1} B 2 . (Wc have 
used here the fact that C is a group which is closed under transposing matrices.) For C £ C, 
the inequality (B\,C) < 1 is obvious and the inequality (B 2 ,C) < 1 can be checked by direct 
inspection. 

The reverse inclusion Q C P follows from the following result. 

Theorem 2. Any facet of the polytope P is defined by an inequality of the form (F, S) < 1 
where F £ T . 

The rest of the Appendix is devoted to the proof of this result. We first need to go in more 
detail into the structure of the Clifford matrices. 

A.l Preliminaries about the Clifford matrices 

Each matrix C £ C corresponds to a "signed permutation" (cr, s), where a £ Sym(3) and 
s £ {±f} 3 . Namely, C has nonzero entries precisely at the (<r(i), impositions with C a u\i = s, 
for i — 1,2,3; we then also denote C as C CTiS . The condition det(C) = 1 translates into 
S1S2S3 = sign(cr); that is, s\s 2 s^ = 1 if a is an even permutation (i.e., one of <j\ :— (1, 2, 3), a 2 :— 
(2, 3, 1), 03 := (3, 1, 2)) and Sis 2 s 3 = — 1 if a is an odd permutation (i.e., one of 04 := (1, 3, 2), 
er 5 := (2, 1,3), (J 6 := (3,2, 1)). Thus the set C of Clifford matrices is naturally partitioned into 
six subclasses 



C = [J C ai where C a := {C CT ... 

o-SSym(3) 



£ {±1} 3 , S1S2S3 = sign(cr)} 



with I C ff I = 4. For convenience we display in the table below the six subclasses C a ; the nonzero 
entries are indicated by *. 





Even permutations 






Odd permuta 


ions 








/ 


* 





\ 






/ 


* 


\ 


Cl 


= (1,2,3) 


C ■ 1 








* 






* I 

* \ 


cr 4 = 


= (1,3,2) 


C ■ 1 




* 
* 


* 1 
J 

\ 


c 2 


= (2,3,1) 


c ■ 1 


* 





* 
* 




/ 
\ 


0-5 = 


= (2,1,3) 


c ■ 1 


* 






* / 


0-3 


= (3,1,2) 


c ■ 1 




* 






/ 


0~6 = 


= (3,2,1) 


c ■ 1 

°CT 6 ■ 1 


* 
* 



/ 



Table 1 

The following observation can be directly verified and will be useful for the proof. 

Observation 1. Let a £ Sym(3). Then, ^2 CeC C = 0. Moreover, for any position (o~(i),i) 
corresponding to a nonzero entry for matrices in C a and for d £ {±1}, there exist C, C" £ C a 
with C + C = 2dE c if j> i, which implies dE^u^i £ P. Thus, ±Eij £ P for any i,j = 1, 2, 3. 
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We now proceed with the proof of Theorem [21 Let (P, S) < b be an inequality defining a 
facet of P, where F E M 3x3 and b E K. That is, the inequality {F, S) < b is valid for P, which 
means that (F, S) < b holds for any S E P, and the set 

ft F := {C E C | (F, C) = b] 

contains nine affinely independent matrices. Without loss of generality, we may assume that 
6 = 1. Indeed, b > since £ P. Moreover, b > for, otherwise, we would have Py = for all 
i.j = 1, 2, 3, implying P = 0, in view of Observation^ Thus, by rescaling, we can now assume 
that the facet is of the form (P, S) < 1. We sometimes speak of the "facet P" for short. Our 
objective is to show that P = C1BC2 for some C\, C% £ C, B £ {Pi, Pf, P2j-- 

Call F,F' E M 3x3 equivalent if P' = C1PC2 for some C ll C 2 € C. Then, as C is a group, 
(P', 5) < 1 defines a facet of P if and only if (P, C) < 1 does. Moreover, K F > = C 1 TZ F C 2 = 
{C1CC2 I C € 72-f}- This property will be used repeatedly throughout the proof as it permits 
to exploit symmetry and to reduce the number of case checking. 

The proof is based on a detailed inspection of the structure of the set TZf- We begin with 
collecting several properties of the matrix P and the set TZf ■ 

Observation 2. \1Z F n C CT | < 3 for any a € Sym(3). 

Proof. If C a C TZf, then (P, C) = 1 for any CeC,, which implies 4 = X)cec (-^ ^)> contra- 
dicting the fact that X^cec C = by Observation^] D 

Observation 3. If Fij = d E { — 1, 1}, then all C £ C with dj = d belong to Rf- 

Proof. Let C £ C with Cy = rf. There exists C £ C with C + C = 2dPy. Summing (P, C) < 1 
and (P, C") < 1 yields (P, C + C") < 2. As (P, C + C") = 2dF tj = 2, we have the equalities 
(P, C) = (P, C) = 1, which implies C ETZp. U 

Observation 4. Let C ^= C E TZf H Co- (/or some cr G Sym(3) y ) and assume that C^u^i — 
C '<r(i),i = d G i 1 * 11 } /° r some i e I 1 ' 2 ' 3 )- T/ie ^ F *(i),i = d and F <y(j),j + sign(cr)dP .(fc) j fe = 
mtt{i,fe} = {l,2,3}\ {1}. 

Proof. Equality P CT (j),j = d follows from the fact that C + C = 2dE u u\ i . Then, 1 = (P, C) 
implies = F a(j) jC cr{j) j+F a(khk C lj{k ^ k . Using C .(j) ]i C a .( i)J C .( S .) !fe = sign(cr), we find P CT ( i)ii + 
sign(o-)dP CT(fc)!fc =0. □ 

Our last observation is an easy corollary of the former two observations. 

Observation 5. If P CT (i) j! : = d E {±1} (for some a E Sym(S)), then F a fj\j +sign(cr)dP cr ( fc ). fc = 
0, where {i,j, k} = {1, 2, 3}. 

One can verify that, for P = Pi, Pf , \TZ F n C CT | =2 for all cr e Sym(3) while, for P = P 2 , 
|7^f n Co- 1 =3 for a = <7i, (J5. Based on this observation we now distinguish two cases: Either, 
\TZf H Co| < 2 for all a £ Sym(3) (in which case we show that P is equivalent to Pi or Bf), or 
\TZf H Co I = 3 for some a E Sym(3) (in which case we show that P is equivalent to P2). 

A.2 The case \K F H C a \ = 3 for some a E Sym(3) 

Using symmetry, we may assume that \TZf nC CT | = 3 for the (odd) permutation a = (74. We 
prove this in detail to show how this kind of symmetry argument works. Define the matrices 

c, = I i) ■ I . r-2 = I (i + J . c, = I - 1 ci.!)) 
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lying, resp., in C CT4 , C CT3 , C a2 . Recall that +, — stand for 1,-1, respectively. Our assumption 
is that \1Zf H C ai \ = 3 for some i — 1, ... ,6; we show that one can replace F by another 
equivalent facet F' in such a way that i = 4 holds. For this, suppose first z = 2, 3. As the 
mapping X h-* ACi maps C CTi to C^, we can replace the facet F by F' :— Fd and then we 
find \1Zf' H C CTl | = 3 since 7\Lf' = IZfCi. Thus we may assume \1Zf H C CTl | = 3. As the mapping 
X h^ XCi maps C CT1 to C CT4 , replacing the facet F by F' := Fd, we find \H F , nC CT4 | = 3. Thus 
we can now assume |7£f H C CTi | = 3 for some i = 4, 5, 6. If i = 5, as the mapping X i— ► AC3 
maps C CT5 to C ai , replace F by F' := FC3; if i = 6, the mapping X 1— » AC2 maps Co- 6 to C ai and 
one can replace F by F' := FC2; in both cases we get back to the case when \1Zf> fl C CT4 1 = 3. 

Thus we now assume \R-f HC CT4 | = 3. Moreover, we may assume that the following matrices 
from C ai 

( + \ / - o o \ / - o o \ 

(20) 

belong to IZf. (To see this, replace if necessary F by FC, where C £ C ai .) Using Observation 
0J we obtain F\\ = — l,F^z = —1,^32 = 1. From this we get by Observation [3] that also the 
matrices 

/ - \ / - \ 

eC ffl , (21) 

eC CT2 , (22) 

- I e C CT3 (23) 

(24) 

belong to IZf- By Observation 0J we also obtain F22 = F33, F12 = F31 and F13 = —F21. 

Claim 1. There exists also an even permutation a G Sym(3) for which \1Zf nC CT | = 3. 

Proof. Assume for contradiction that, for i — 1,2, 3, the set KfllC ffi contains only the respective 
two matrices from (|21() - (|23ll . Choose a subset B C IZf consisting of nine affinely independent 
matrices and such that IZf H C ai Q B. We have \B n C CT1 1 < 1, since the two matrices in l|2*T|l 
are affinely dependent with the last two matrices in l|2()|l. Similarly, |i3nC CT2 | < 1, IBnCcrgl < 1. 
As \B\ = 9, we deduce that \B n C CT5 1 > 2 or \B D C CT J > 2. Assume first that |B n C CT5 1 > 2. Say, 
C 7^ C" € 72-f H C CT5 . Then C and C" have the same nonzero entry de { — 1,1} in some position 
(k, I). By Observation0]this yields F^i — d. Now, there is also an even permutation a for which 
k = a {I). By Observation |3| we then deduce that at least two matrices from C a must be in Tip, 
which contradicts our assumption. The other case |23nC CT6 | > 2 goes analogously. □ 

It is sufficient to consider the case \1Z-p n C ai \ = 3. Indeed, if \7Zf H C CT2 1 = 3, then one may 
replace F by C3FC4 with C3 as in ljl§|) and 

C 4 := 

since the mapping A 1— > C3AC4 maps C CT2 to C ai and preserves the set of three matrices from 
[J2UJ1 . as well as the set of 6 matrices from (J2U>-(ESl (namely, (J3TJ -* © -> Q -*• (EU)- One 
can handle the case when \1Zf H Co-j | = 3 in the same way. 
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The set IZf already contains the matrices 
from C ai (displayed in H21J I). The remaining two matrices of C ai are 







If D4 <E IZf, one may replace the facet F by F' :— D2FD1 to obtain that Di,D^,Dz € TZf 1 , 
since the mapping X 1— ► D%KD\ maps {DijDa,^} to \D\,Dv,D{\ and leaves the set of 3 
matrices from (12011 invariant as well as the set of 6 matrices from l|21 [l -l|23 jl . Thus we may assume 
that D 3 eK F . 

By Observation Q] we find that F33 = F22 = 1 ■ As F22 — 1 , Observation implies that 
F31 = F13. Similarly, F33 = 1 implies that F12 — Fu. Putting all equations together we obtain 
F12 = F21 = — -F13 = — -F31 = — -Fi2j implying they are all zero. Thus 

- 

+ - I = I ] I B 2 I I I . (25) 

0+4 

A.3 The case \K F n C a \ < 2 for all a G Sym(3) 

Let again B C 7?-f consist of nine affinely independent matrices. As |7£f| > 9, |72.f H C ct | = 2 
for at least three permutations cr. W.l.o.g. we can assume that two of those permutations are 
odd permutations and that they are equal, say, to 04 and erg (replacing if necessary F by an 
equivalent facet). Further we may assume IZf contains the following two matrices of C CT4 : 

(26) 

This can be seen using the following two mappings X t— > C2XC2 (with C2 defined as in H19(| s | 
and X ^^ CX (with C G C ffl ) which permit to map any subset of size 2 of C a4 to any other such 
subset and which preserve C a6 as well. We choose the basis B containing the two matrices of 
H26fl . ^From Observation2]we find Fu = —1 and F23 = — F32 7^ ±1; the latter inequality follows 
from the fact that \1Zf H C a4 \ = 2 combined with Observational As Fu = — 1, by Observation 

El 

' - \ / - \ 

- , + \ elZ F nC ai (27) 

oo+y \o -/ 

and Observation^] implies F22 = F33 ^ ±1. At most one of the two matrices in IJ27II belongs to 
B since they are affinely dependent with the matrices in (|26|l . Say, \B n C ai | = 1. 
Let us now examine which two matrices of C CT6 belong to TZf ■ Set 





The two mappings X >-^ XC5 and X 1— ► C5X preserve the set of matrices in (|26|) and permit 
to map any other matrix of C<j 6 to the matrix X\. Therefore we can assume w.l.o.g. that 
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X\ G TZf H C a6 . The second matrix of IZf H C CT6 does not have entry — 1 at the position (2, 2) 

since, otherwise, .F22 = — 1 contradicting an earlier claim. Hence the second matrix in 7?.^ n C,j 6 

is 

/0 +\ /0 

X 2 := + , or X s := + 

V- 0/ \+ 0, 

1. Consider first the case when A 2 € 7^_f H C ae . Then, F31 = —1 and F22 = — F13 7^ ±1. As 
-F31 = — 1, we have 

0-0 

+ ) € n F n C CT3 (28) 

- 

and F12 = -F23 7^ ±1. As £> contains at most three of the matrices X\, X2 and in (|28|l . we 
must have |2?nC CT2 1 = 2 or |SnC CT5 1 = 2. We obtained eralier that F 33 = F 22 =-F 13 ^±l 
and F12 = F23 = — F32 7^ ±1. In other words, the second and third columns of F contain 
no entry ±1. On the other hand, the two matrices from BnC ai (i = 2, 5) have one common 
nonzero entry which therefore is located in the first column, at the position (2, 1). This 
implies F21 = ±1. 

(a) If i*2i = 1, then Observation implies F12 — F 33 and F13 = — F32. Combining with 
the former relations on entries of F, we find 





F = + I . (29) 

(b) If F21 = — 1, then in the same way we find 

(~ ° 

F = - I . (30) 




In both cases we find that F is equivalent to B\ . 
2. Consider now the case when A3 € Up n C a6 . Then, F13 = — 1, F22 = — -F31 7^ ±1, 

- \ 

g n F n c CT2 (31) 

- / 

and F21 = F 3 2 7^ il. In the same way as in the first case one finds that F is equivalent 
to Bf. 

This concludes the proof of Theorem 
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